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ABSTRACT 

The assessment of dimensionality of data is important 
to item response theory (IRT) modelling and other multidimensional 
data analysis techniques, rne fact that the parameters froia the 
factor analysis formulatiou for dichotomous data can be expressed in 
terms of the parameters in the multidimensional IRT model suggests 
that the assessment of the dimensionality of the latent trait space 
can also be approached from the factor analytical viewpoint. Some 
problems connected with the assessment of the dimensionality of the 
.\atent space are discussfd, and the conclusions are supported by 
simulated results for sample sizes of 250 and 500 on a 15-item test. 
Five tables contain data from the simulation; and 48 graphs 
illustrate eigenvalues and plotted mean residuals. A 46-item list of 
references is includad. (^uthor/SLD) 
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Abstract 



The assessment of dimensionality of the data is not only 
important to item response theory (IRT) modelling, but has 
also been important to other multidimensional data analysis 
techniques. T..e fact that the parameters from the factor 
analysis formulation for dichotomous data can be expressed in 
terms of the parameters in the multidimensional IRT models 
suggests that the assessment of the dimensionality of the 
latent trait ^ace can also be approached from the factor 
analytical point of view. In this paper some problems 
connected with the assessment of the dimensionality of the 
latent space are discussed and the conclusions are supported 
by simulated results • 
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On the Assessment of Dimensionality in 
Multidimensional Item Response Theory Models 

The dimensionallL,' of a set of test items has been a 
source of debate in educational and psychological literature. 
The development of the well-known one-, two-, and three- 
parameter item response theory (IRT) models has increased the 
need of adequate tests for the unidimensionalit y of the 
latent trait space. See Hattie (1985) for a review of methods 
to assess unidimensionality , In many situations the 
assumption of a unidimensional latent trait is untenable and 
this has lead to the development of multidimensional latent 
trait models. As these models gained currency in educational 
research and their application became feasible by the 
development of computer programs like TESTFACT (Wilson, Wood 
^ Gibbons, 1984), NOKARMII (Eraser, 1988) and MAXLOG 
(McKinley & Reckase, 1983), the need for a measure to assess 
the adequacy of the model i*e. the assessment of the 
dimensionality of the latent trait space, increased- It is 
well-known that misspecif ications of the dimensionality of 
the model will severely change the estimates of the item 
parameters. Unfortunately no widely accepted index to 
identify the dimensionality of items is yet available. 

The dimensionality of data is not only an important 
issue in IRT modelling, but has also been important to other 
multidimensional techniques, such as factor analysis. 
Recently it has become known, that there is an important 
relation between the multidimensional IRT model and the 
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factor analysis model for dichotomous data. Takane and de 
Leeuw (1987) showed the formal equivalence of the marginal 
likelihood of the multidimensional two-parameter normal ogive 
model (Bock and Aitkin, 1981) and the likelihood of the 
factor analysis model for binary data (Muth6n, 1978) . This 
means that the IRT parameters can be expressed in terms of 
the parameters of the factor analysis formulation . An 
empirical comparison done by Knol and Berger (1990) showed 
that for the simulated conditions, common fa* tor analysis 
estimation procedures performed equally well as the marginal 
maximum likelihood estimation of the IRT parameters proposed 
by Bock and Aitkin (1981) . The estimates of the item 
parameters obtained from MINRES (Harman & Jones, 1966/ Zegers 
& Ten Berge, 1183) computed on tetrachoric correlations among 
the items, for example, were quite comp^.rable with the 
estimates obtained Irom TESTFACT. These results suggest that 
the problem of assessing the dimensionality in 
multidimensional IRT models for dichotomous data can also be 
approached from the factor analytical point of view* 

A number of empirical studies have been devoted to this 
problem. Ha^iibleton and Rovinelli (1986), Tucker, Humphreys, 
Lloyd and Roznowski (1986) and McDonald (1985) among others, 
considered various measures and Muth^n (1978) and Bock and 
Aitkin (1981) provided formal statistical tests. 

The purpose of the present study is to discuss some of 
the problems connected with the assessment of the 
dimensionality of a latent space and to compare some factor 
analytical methods for the assessment of dimensionality with 
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procedures available from multidimensional IRT models. First, 
however, a short description of dimensionality will be given. 



Multidimensional IRT models assume that an examinee is 
characterized by more than one latent trait. Let fl be the 
vector of m latent traits of an examinee, i.e. 



point in an m-dimensional latent trait space. If each of the 
traits influences the performance of the examinee on at least 
two of the n items in a test, then m is the dimension of the 
latent space. This definition of dimensionality is connected 
with the principle of local independence* The formal 
requirement of the independence of a set of item responses 
X = IX^, / Xj^], is that the joint distribution of the 

responses given a latent trait vector fi is equal to the 
product of the marginal distributions of the items given ft, 
i.e.: 



This means that if a population of examinees is characterized 
by m latent traits which completely span the latent space, 
then the responses of a 'sub' population of examinees with 
fixed values for ft are mutually statistically independent. 



The Dimensionality of the Latent Trait Space 



ft a [Bi, 62 , 



6j«] . An examinee can be represented as a 




(1) 
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If, however, a model specifies a number of latent traits, 
which do not completely span the latent space, then there 
will still remain mutual dependencies among the items for 
fixed values of fl. This is why McDonald {1989) concluded that 
ocal independence is not an assumption as such, but merely 
defines the dimensionality of th3 latent traits. In other 
words, the local independence principle is formally trivial, 
if no further conditions are added* In addition to locally 
independent models, Jannarone (1986) presented conjunctive 
locally dependent models. Although Suppes and Zanotti (1981) 
presented a process in which any locally dependent model can 
be replaced by an equivalent locally independent model and it 
may be concluded that all latent trait models are in fact 
locally independent models, this does not mean that the 
dependent models discussed by Jannarone (1986) do not provide 
meaningful alternatives to the more traditional independent 
models. However, dependent models will not be considered in 
this paper* 

The above definition of dimensionality of the latent 
trait space does not make a distinction between dominant and 
nondominant dimensions • For example, if some of the latent 
traits guide the performance on only a few items in a test, 
then these traits are not as important for the explanation of 
the total variance as traits that influence all items in that 
test. An example of such a dominant trait is a 'reading' 
trait for a verbal test composed of items on various topics. 
One of the problems encountered in assessing the 
dimensionality of the items is the existence of such dominant 
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(major) dimensions together with less dominant (minor) 
dimensions and this distinction may explain some of the 
conflicting results found in a number of studies on the 
assessment of dimensionality. 

Another problem with the traditional definition of 
dimensionality, is that the principle of local independence 
is really a very strong principle. Goldstein (1980) doubts 
whether this condition is actually met in real life data. To 
assess the dimensionality of items all higher order cross- 
product moments should be considered. Instead, the assessment 
of dimensionality is often restricted to the inspection of 
only first order correlations. McDonald (1981,1985) refers to 
this as a 'wealc' form of the principle of local independence. 
Since multivariate normal variables are mutually independent 
if, and only if all covariances are zero, this 'wea)c' form 
implies the 'strong' form of local independence under 
multivariate normality. 

Both above mentioned problems with the traditional 
definition have led Stout (1987, 1989) to relax the 
definition and to propose a more practical term, namely 
essential dimensionality which is based on the concept of 
essential independence. His definition of essential 
independence is as follows. Given a vector of latent traits 
9, the n responses in X of a sample of N examinees are 
essentially independent, if X satisfies 
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I Cov{Xj^,x^ie=fl) I 



0 



as n ~> oo 



(2) 




If the average covariance of the responses is small in 
magnitude/ then the responses of a ' sub' population with fixed 
fi's are said to be essentially independent, asymptotically. 
The essential dimensionality of an item pool is the minimal 
dimensionality necessary to satisfy the essential 
independence principle. This definition of dimensionality is 
easier to implement than the traditional one. Although all 
common measures for the assessment of dimensionality are 
implicitly based on this definition, only the measures based 
on "residuals" are a direct implementation of this principle- 

Generally procedures to assess dimensionality can be 
grouped into the following four categories; 

- Procedures based on formal statistical tests for the fit of 
a model. 

Procedures based on information theoretic model selection 
criteria , 

- Procedures based on the total amount of explained variance- 

- Procedures based on "residuals" after a model has been 
fitted to the data. 

In the following sections each of these procedures and the 
problems connected with their use^ will be described, 



Finally, some results based on simulated data will be given 
for each of these procedures. 
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Formal Statistical Test for the Fit of a Model 

Bock and Aitkin (1981) proposed to apply the multiple factor 
analysis model to dichotomous data. They assumed that an 
unobserved response process yijr ^or person 1 and item j is a 
linear function of m normally distributed latent variables 
9i^ ^^il, ®i2r-'-' ®iml factor loadings 

[^jlr ^j2'''' ^jm^ ' i.e.: 

Yij " ^jl®il ^ ^j2®i2 + ^jm^im * ^ij- <3) 

®ik ability of person i on ability dimension k^ Xj)^ is 

the loading of item j and dimension k and Vj^j are error 
terms; V^j * MVN(fl,r2)^ where is diagonal matrix with 

positive elements ^j^* contrast to classical factor 

analysis for continuous data the unobseivable process y^j is 
connected to the dichctomous response variables Xj^j by the 
following mechanism: 

if yij ^ Tj, then x^j = 1 
otherwise Xj^j « 0 , 

where Yj is a threshold parameter connected with item j. 

The probability of answering an item j correctly, given the 

ability vector fij^ for person i is 
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where O is the cumulative standard normal distribution. To 
obtain the parameter estimates. Bock and Aitkin (1981) 
developed an iterative marginal maximum likelihood estimation 
procedure based on the EM algorithm due to Dempster, Laird 
and Rubin (1977). Vne procedure was implemented in TESTFACT 
by Wilson, Wood and Gibbons (1984) . 

If the sample consists of N penons with responses on n 
items, then there are s !S irin(2^,N) distinct response 
patterns. The estimated joint probability of the response 
patterns is: 

where rj^ is the frequency of response pettern 1 and P]^ is the 
estimated marginal probability of the response pattern 
computed from the item parameter estimates. The well known 
likelihood ratio statistic is; 

Gm^ - - i,Vl ^" " m fi ] , (6) 

1 = 1 N l~i. 

with dfj«^ s - n{m+l) ^ m(m-l)/2 aegrees of freedom. 
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By means of this statistic two different test procedures can 
be carried out to decide the smallest number of factor such 
that the model fits the data. 

The first procedure is based on a sequence of tests of 
the hypothesis H^*: m *^ ro* against the alternative m > m*, 
for a stepwise increase of the number of factors 
m*« 1, n. The procedure stops when 

The second procedure is based on Haberman's (1977) 
result that the difference of two statistics for alternative 
models is asymptotically chi-square distributed with degrees 
of freedom equal to the difference of the degrees of freedom 
corresponding to each of the two statistics The sequence of 
tests of the hypothesis Hj^^* : m = m* against the alternative 
=; ni* ^ 1 for m* « 1, n-1, stops when: 

G^iff - - ^ Xi_a, (n-m)- 

This procedure is especially useful when some of the expected 
cell frequencies are almost zero. This may often be the case 
when N < 2". 

Both procedures can be carried out by the TESTFACT 
program and Bock, Gibbons and Muraki (1988) concluded from an 
empirical comparison that these procedures seem adequate in 
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selecting the number of factors • There are, however, some 
problems connected with these test procedures. 

The first problem is that of the asymptotic 
distribution. Kendall (1977) has pointed out that Pearson's 
chi-square and the likelihood ratio statistic are often 
regarded as equivalent becajse of their asymptotic 
properties- In practice, however, the results may be very 
different • Although maximum likelihood estimators of 
parameters retain standard large sample properties under 
various conditions, the large sample approximation of chi- 
square statistics are often unacceptable. Moreover, the 
asymptotic test for the fit of a model is generally very 
sensitive to sample size- For relatively small samples the 
procedure is unreliable, because of the small expected cell 
frequencies and for large samples the procedure a]<most always 
favours the alternative hypothesis (McDonald, 1989) . 

A second problem is the choice of the level of 
significance. The adequate selection of the number of factors 
is in fact a multiple decision problem. In general, a whole 
family of hypotheses t2 s { Hj^liel } can be considered, where 
I is an index set. Corresponding with each hypothesis 
there is a real valued statistic . The family of statistics 
G » { G^liel ) and the family of hypotheses SI together form a 
a so-called testing family [fl , G] . The overall probability a 
of making type I errors is unknown, and some sort of 
simultaneous test procedure (STP) will be needed to control a 
for the whole testing family. Although a likelihood ratio 
based STP is available for the testing of association in a 

er|c 1 6 
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contingency table {Gabriel, 1969) , there is no STP available 
yet for the fit of a multidimensional IRT model. Some sort of 
correction, like the one proposed by Aitkin (1979) may be 
useful. His correction is based on the fact that the upper 
limit of the overall c for n tests with level of significance 
OLi is {1 - (1-ai)^}. 

A third problem is the coherence of the procedure. The 
hypotheses Hj^ for the testing of the dimensionality of the 
latent trait space are ordered. A two-dimensional model, for 
example, will explain more variance than a one^dimensional 
model. If the hypothesis of unidimensionality of the latent 
space is accepted, a coherent test procedure will also 

accept the hypothesis H2 that there are two dimensions. 
Gabriel (1969) proved that for likelihood ratio based 
statistics the testing family is monotone, i.e. whenever 
two hypotheses Hj^ and Hj are ordered (nested) such that i<j/ 
the numerical relation g| S G^ holds for (i,j e I). 
Increasing the number of factors in the model will decrease 
the values of the likelihood ratio statistics for the fit of 
the model. Note that tests based on the union intersection 
principle are also monotone, but that Pearson's chi-square 
statistic is not (Gabriel, 1969) . 

The monotonicity of the testing family, however, does 
not necessarily indicate that the testprocedure is coherent . 
Coherence prevents the contradiction of rejecting a model 
with say m factors without also rejecting all models with 
less than m factors. 
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To illustrate the above mentioned', problems the results 
for the BOck and Lieberman (1970) LSAT-T data are given in 
Table 1, It seems difficult to draw conclusions. For a 
significance level = 0 • 10^ the results give evidence of a 
two factor model. If the relative large sample size is taken 
into account and a smaller is selected, a^^ = 0,01, then a 
one factor model seems more appropriate. The non-coherence of 
the test procedure is demonstrated by the fact that the p- 
values do not always increase as the number of factors 
increases. Although this example does not actually show an 
incoherent decision^ it can be seen that the possibility 
exists . 



Insert Table 1 about here 



A Model Selection Criterion 

Since we are dealing with a model identification problem and 
inferences are based on maximum likelihood, it is suggested 
to assess the dimensionality of the latent trait space by a 
model selection criterion which does not have the above 
mentioned disadvantages of the formal test procedures, 

Akaike (1974) developed an information theoretic 
criterion for the identification of optimal and parsimonious 
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models in data analysis. Akaike's information criterion for 
model (4) takes the form of a maximized likelihood and is 
defined as: 

AlC(m) a -2 In [ L^^ (fi,^, ^ Yj ) ] + 2 kj^, (9) 

where (fi/ L Yj) is the maximized likelihood and kj^ is the 
number of independent parameters in the model. {-AIC(m)/2) 
is an unbiased estimate of the mean expected log likelihood. 
The first term of AIM(m) is a measure of badness of fit. The 
term 2kj^ is the penality term correcting for overfitting due 
to the increasing bias in the first term as the number of 
parameters in the model increases. The model with the minimum 
AIC (m) value is chosen to be the best fitting model. Bozdogan 
and Ramirez (1988) and Ichikawa (1988) showed that the 
criterion worked quite good in assessing the number of 
factors in ML factor analysis. A review of the general theory 
is given by Akaike (1987) and Bozdogan (1987). 

To assess t" . dimensionality, it will be convenient to 
formulate a criterion comparing the likelihood of the model 
with m factors with that of the saturated model. The log of 
the maximized likelihood for the normal ogive model is: 

s s 

In ihm) « In N! f I ri In Pi - £ In (ri!). (10) 
^ 1-1 ^ ^ 1«^ ^ 
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Th* number of independent estimated parameters is 
n (m^^l) -m(m-l) /2. Subtraction of this expression from that of 
the log of the likelihood for the saturated model with 
s 5 min (2^,N) different response patterns, results in the 
following AIC expression: 

s r ^ 

AlC(m) =2 [ I ri In — - I ri In gi] - 2 (dfj^) - (11) 
1-1 N 1^1 ^ 

A minimum value for the AIC (m) criterion will indicate the 
'true' dimensionality. In Table 1 the AlC(m) values are given 
for the LSAT-7 data. These values show that a 2-f actor normal 
ogive model would probably be the best fitting model. 



Procedures Based on the Proportion of Explained Variance 



The determination of the number of components in principal 
component analysis and in factor analysis is often based on 
the amount of explained variance, i,e, based on criteria 
formulated on the eigenvalues of a correlation matrix. Among 
these procedures are the well known Kaiser's (1960) 
eigenvalue greater than 1.0 rule, the scree test of Cattell 
(1966) and the parallel analysis method (Horn, 1965) , A huge 
amount of research has been done on these methods* Although 
there are sometimes conflicting conclusions, the main trend 
seems to be that parallel analysis and the scree test perform 
quite good for continuous data (Zwick & Velicer, 1986) . 
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Similar methods have also been proposed for dichotomous 
data. Collins, Cliff, McCormick and Zatkin (1986) 
investigated the modified scree test on phi and tetrachoric 
correlations and concluded that both procedures perform 
poorly. Although they prefer the analysis with phi- 
correlations, Green (1983) and Hambleton and Rovinelli (1986) 
among others, found that phi coefficients produce spurious 
factors based on the difficulty level of the binary items. On 
the other hand, Drasgow and Lissak (I9b3) found that a 
modified parallel analysis was quite good in detecting the 
unidimensionality of dichotomous data. Tucker, Humphreys, 
Lloyd and Roznowski (1986) compared some eigenvalue indices 
and found that these indices did not work very well. Recently 
Bernstein and Teng (1989) questioned the application of 
eigenvalue criteria to categorical data. They concluded that 
false evidence of mult idimensionality is often found. These 
results do not seem to be very encouraging. Moreover, there 
are some problems in applying eigenvalue criteria to 
dichotomous data. 

The first problem with these criteria is the choice of 
correlation coefficient. Phi coefficients generally produce a 
positive definite correlation matrix and tend to avoid the 
problem of Heywood cases. On the other ^and, phi coefficients 
tend to overestimate the number of underlying dimensions. The 
use of tetrachoric correlations will produce a more reliable 
estimate of the underlying dimensions, but the sample based 
estimate of the correlation matrix is often not positive 
definite and tends to produce more Heywood cases- Although 
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fast and generally accurate computational algorithms exist 
(Divgi, 1979) the coefficients will become unstable when 
extreme values are reached. Morec/er, the use of tetrachoric 
correlations is inappropriate when ability distributions are 
not normal and the item response function is not normal ogive 
(Lord, 1980) , Of course, the problem of non positive 
definiteness can be avoided by smoothing the correlation 
matrix. But this may change the pattern of the eigenvalues, 
especially when n is relatively small. 

P.nother problem which is not generally recognized, is 
that the pattern of the eigenvalues depends on the size of 
the discrimination parameters of the items. Under the 
assumptions for model (3), that the abilities fl^ - MVN(^,I) 
and the errors 12j MVN (i, F^), the covariance matrix of the 
unobservables y^j is: 

= AA' + r2. (12) 

The nxm matrix of factor loadings is A and is a diagonal 
matrix with unique variances Cj^. Under multivariate 
normality the population correlation matrix is: 

R- (Diag(AA')+r2r^^^ (AA'+r2] [Diag (AA' ) +r2 ] (13) 
Equation (13) can also be written as: 

R= [1+ Diag(AAM AAn [ I+Diag (AAM ] (14) 
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where A is an nxm discrimination parameter matrix and relates 
to A by A« r^^A. It can be seen, that an increase of values 
of the discrimination parameters will generally increase the 
correlations . 

There is no analytical expression available for the 
eigenvalues of an arbitrary correlation matrix, but from a 

A 

theorem due to Gersgorin (see Pullman, 1976) an upper limit 
of the largest eigenvalue e^^^ of an arbitrary positive 
definite matrix can be derived: 



®max ^ inax{ r^lr^jl , i*l,..*,n) , (15) 



where r^j is an element of R« This upper limit is generally a 
good approximation of the largest eigenvalue of a correlation 
matrix (Morrison/ 1976) , Increasing the discrimination values 
will increase the correlations among the items and will 
increase the largest eigenvalue. Since e^ = n, increase of 
the largest eigenvalue will generally decrease the other n-1 
eigenvalues - 

In conclusion^ the pattern of the eigenvalues of the 
correlation matrix is influenced by the size of the 
discrimination parameters of the items and a procedure based 
on the pattern of the eigenvalues may lead to erroneous 
conclusions about the dimensionality of the latent trait 
space . 
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Procedures Based on Residuals after Fitting the Model 

One of the conclusions drawn by Hattie (1984) and Ham£>leton & 
Rovinelli ' 36) was that residuals obtained from nonlinear 
factor analysis could very well determine the correct 
dimension of a latent trait space and McDonald (1981,1989) 
recommended the use of the mean (absolute) residuals. 

If Aj^ is the nxm estimated matrix of factor loadings 
from a solution with m estimated common factors, and R is the 
tetrachoric correlation matrix, then the off-diagonal 
elements of the matrix: 

R^ = R - A^Am' / <16) 

are the residuals ^ij* The mean squared and mean absolute 
residuals are: 



f.= 2(n{n-l) ]-l r I (rj4)2 
^ i<j 

and 

f^= 2[n(n-l) I I IrJ J , (17) 

i<j ^ 



respectively. Since the squared residuals fi are generally 
more sensitive to outliers than absolute residuals f2/ only 
the results for f2 will be reported in this study. 

It has already been mentioned, that the mean absolute 
residuals is an implementation of the definition of essential 
dimensionality by Stout (1987, 1989) , The only problem in 
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applying this criterion is that it may not be clear when the 
value of the criterion is small enough. A possibility is to 
compare the f2 criterion after the fit of an m-dimensional 
model with values of this criterion from random data. 

Conditions for the Simulation Study 

To cort^'^'re the performances of the different measures binary 
data matrices were generated for known difficulty and 
discrimination parameters. Though several values for these 
parameters were considered, the results of only two matrices 
with different discrimination parameter values will be given. 
The two matrices are: 



1.0 0.5 0.0 
1.0 0,0 0.5 
L 0.0 0.5 1.0 



(5x) 

(5x) Ajj 
(Sx) 



2.0 2.0 0.0 
0.0 2.0 2.0 
2.0 0.0 2.0 



(5x) 
(5x) 
(5a) 
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The notation in brackets (5x) indicates that groups of 5 
items have the same pattern* The sample sizes were set equal 
to N = 250 and N ^ 500, and the number of items was n - 15. 
Each group of 5 items with the same discrimination parameter 
values has difficulty parameter values -2, -1^ 0< 1 and 2. 
Generation of data from a one-dimensional and a two^ 
dimensional latent trait model was done by using the first 
and the first two columns of matrices and An, 

respectively * The actual generation of the binary data for 
the multidimentional IRT model was done by using the random 
number generator of the NAG (1984) program library and the 
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sample tetrachoric correlations were computed by a 
modification of the Divgi (1979) procedure. 

Results 

For almost all conditions of the simulation study the test 
procedure erroneously favoured the alternative hypotheses* 
The results for the G^diff test procedure are given in Tables 
2 and 3- 

These results indicate that the test procedure is often 
not capable of locating the correct dimension of the latent 
space* The results in Table 2 show that the test procedure is 
not capable of locating the minor third dimension* It must be 
emphasized that these results are based on only 10 simulated 
data matrices per condition and that more simulations will be 
needed to draw firm conclusions* The huge amount of computer 
time needed by TESTFACT prevented us from expanding these 
simulations . 



Insert Tables 2 and 3 about here 



The results for the AIC Criterion are given in Tables 4 
and 5* Although there are misspeclf ications, it can be seen 
that the AIC criterion tends to locate the correct dimension 
of the latent space somewhat better than the X^iff procedure* 
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Again it must be en^haaized/ that the number of siiuulated 
runs is small/ due to the large amount of CPU time needed to 
run TESTFACT* 



Insert Tables 4 and 5 about here 



Figures 1 and 2 show the eigenvalue patterns for the two 
discrimination parameter matrices Aj and Ajj. The points in 
the Figures indicate the means of the eigenvalues for 50 runs 
and the vertical lines from the plotted points indicate the 
range of the eigenvalues. The almost horizontal line is the 
eigenvalue pattern for random data matrices. The Figures show 
that only parallel analysis seems adequate, although the 
minor third dimension in Aj is not recognized. 



Insert Figures 1 and 2 about here 
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Residuals have been obtained from various estimation 
procedures. In the Figures 3 through 8 the results are given 
for the common factor analysis procedure MINRES performed on 
the matrix of tetrachoric correlations, the residuals 
obtained from TESTFACT and the residuals obtained after an 
unweighted least squares approximation of pairwise 
proportions due to McDonald (1985) and computed by NOKARMII . 



0 i 
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The computer program MAXLOG, with joint ML es miration was 

also considered. The residuals from MAXLOG, however, were not 

very adequate, probably because of the drift of the 
discrimination parameters - 



Insert Figures 3 through 8 about here 



The pattern of the residuals show a relatively large 
drop of f2 after a model with the correct dimension of the 
latent traits is fitted to the daca. Comparison of the f2 
values with those from random data indicate that all 
procedures perform quite well and that residuals from 
NOHARMII even were able to discover the minor third dimension 
in Aj . 

Summary and Conclusions 

The assessment of dimensionality of the latent traits is 
a difficult problem and each of the discussed measures has it 
own disadvantages. In this paper the assessment of 
dimensionality is approached from the factor analytical point 
of view. Although the results from this study do not indicate 
that me of the measures performs best in assessing the 
dimensionality of a latent space, the results seem to 
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indicate that some of the measures are not very good. The 
following conclusions may be drawn from this study: 

- Asymptotic tests for the fit of a latent trait model are 
not very reliable. The AIC criterion seems to perform 
somewhat better/ but more research will be needed to draw 
definite conclusions. 

- The structure of the matrix with discrimination parameters 
is crucial for the performanct of the eigenvalue measures, 
because the size of the discrimination parameter values 
determines the explained variance of each factor in the 
model. 

-The disadvantages of the eigenvalue criteria and their 
performance in this study lead to the conclusion that these 
measures should be avoided. Parallel analysis, however, did 
perform quite good. 

- The "residual** measures from TESTFACT, MINRES and NOHARMII 
performed quite good/ and even a minor dimension could be 
located by NOHARMII. 

Finally, it must be noted, that the conditions of this 
study were limited and that the results from TEbTFACT were 
based on a small number of runs. More simulations will be 
needed to drawn more definite conclusions. 
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Table 1 

Chl-aQuarf » fit-.atiatinfl and AIC Criterion for LSAT-7 data 
(N=1QQQ. n°5. quadrature PQint3°7) 
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Table 2 

Percentaaaa of decisions about the dimenaionalitv of the 
latent tra i^ aoace for Aj with the 0^4^^ Proceduge ^a^^iLJm 
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Table 3 

Percentagea of decisiona About thft dimengionalitv of thp 
lal^ent ^r ^it space for Aj j wlttl tbS Gdi ff ggOCCdure (glZfl-Jl^ 
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Table 4 

Parcentaff f^a of f^eifliona about the dimensional itv of the 
i j^t-«»nt tra it space for with the AlC(m) criterion 
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Table 5 

Percentaqiefl of dp riaions about the dimensionality of the 
lat.Pnt trait space for with the AIC (m) criteria 
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Figure Captions 

Figure 1. Eigenvalue plots for matrix of discrimination 
parameters Aj- 

r ; iqure 2, Eigenvalue plots for matrix of discrimination 
parameters Ajj* 

f igure 3. Plotted mean absolute residuals for Aj and MINRES. 
Fiav^re 4, Plotted mean absolute residuals for Ajj and MINRES. 
Figure 5. Plotted mean absolute residuals for Aj and 
NOHARMII, 

yiffure 6. Plotted mean absolute residuals for Ajj and 
NOHARMII. 

Figure 7- Plotted mean absolute residuals for Aj and 
TESTFACT. 

ri,gi 4^e 8. Plotted mean absolute residuals for Aj and 
TESTFACT, 
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